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Abstract. A basic pigeonhole principle insures an existence of two objects of the same type if the 
number of objects is larger than the number of types. Can such a principle be extended to a more 
complex combinatorial structure? Here, we address such a question for graphs. We call two disjoint 
subsets A, B of vertices twins if they have the same cardinality and induce subgraphs of the same 
size. Let t(G) be the largest k such that G has twins on k vertices each. We provide the bounds on 
t(G) in terms of the number of edges and vertices using discrepancy results for induced subgraphs. 
In addition, we give conditions under which t(G) = \V(G)\/2 and show that if G is a forest then 
t(G)>\V(G)\/2-l. 

1. Introduction 

For a combinatorial structure G and a set of parameters, we say that two disjoint substructures 
are twins if those parameters coincide for each substructure. This notion generalizes the pigeonhole 
principle and was investigated in case of sequences and graphs. In the case of sequences it is known, 
see Axenovich, Person and Puzynina [1], that any binary sequence of length n contains two disjoint 
identical subsequences of length n/2 — o(n) each. Twins are defined to be two disjoint vertex subsets 
of the same size and the same multiset of pairwise distances were studied by Albertson, Pach and 
Young p], and by Axenovich and Ozkahya [3]. Edge-disjoint isomorphic subgraphs were also addressed 
in O [101 El . 

Here, we concentrate on the following twin problem in graphs introduced by Caro and Yuster in [6]. 
For a graph G, we call two disjoint subsets of vertices twins if they have the same size and induce 
subgraphs with the same number of edges. Let t(G) be the largest k such that there are twins A, B 
in G with \A\ = \B\ = k. Let 

t(n) = min{t(G) : \V(G)\ =n}. 
The best currently known bounds on t{n) are given in the following theorem. 

Theorem 1 (Caro and Yuster [6]). There exists a positive constant c such that ^/n < t(n) < n/2 — 
clog logn. 

Ben-Eliezer and Krivelevich, [9], proved that t(G(n,p)) = \n/2\ with high probability, where G(n,p) 
is the Erdos-Renyi random graph. In addition, Caro and Yuster proved that in a sparse graph there 
are twins of size almost n/2: 
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Theorem 2 (Caro and Yuster [6]). For every fixed a > and for every e > there exists N = N(a, e) 
so that for all n > N, if G is a graph on n vertices and at most n 2 ~ a edges then t{G) > (1 — e)n/2. 

In this paper, we do the following: 

• In Theorem [4j we improve the above result using a more general theorem on discrepancy for 
induced subgraphs. 

• In Theorem [5j we give several sufficient criteria under which the graph has perfect twins; i.e., 
two twins spanning the whole graph. 

• In Theorem [6j we prove that all but at most two vertices of any forest could be split into two 
twins. 

For disjoint vertex sets A, B in a graph G, we denote by e(A) the number of edges induced by A, 
e(A, B) the number of edges between A and B. The discrepancy between A and B is disc(A, B) = 
\e(A) — e(B)\. Let lgn denote the logarithm of n in base 2. The maximum and the minimum degree 
of a graph G are denoted by A(G) and 5(G), respectively, the set of vertices of degree i is Vi. A pair 
of vertices is called consecutive if the degrees of these vertices differ by exactly 1 or —1. 

Theorem 3. If G is an n vertex graph, n > 16, then there are vertex disjoint sets A and B such that 

• disc(A-B) < 21g 2 n and \A\ = \B\ > (n - 21gn)/2 or 

• disc(A, B) < (A(G) - 5(G) + l)/2 and \A\ = \B\ = [n/2\ . 

Theorem 4. If G is a graph on n vertices and e edges then t(G) > § ^1 — 20v ^ lg " ^ . 

This theorem implies, in particular, that any n- vertex graph on o (n 2 /lg 2 ra) edges has twins of size 
n/2 — o(n) each and that planar graphs have twins of size at least n/2 — clgn each. 



Theorem 5. Let G be a graph on n vertices, where n is even. If one of the following conditions 1) 4) 
holds then t(G) = n/2. 

1 ) The degree sequence of G forms a set of consecutive integers. 

2) \Vi\ is even for each i. 

3) n > 90 and \{i : \Vi\ is odd}\ > n/2. 

4) There are at least A(G) — 5(G) disjoint consecutive pairs of vertices. 
Theorem 6. If G is a forest then t(G) > [n/2] - 1. 

If n is odd this is clearly best possible. For even n this is attained, for example, by a star. Such a 
graph has no perfect twins. 

The rest of the paper is organized as follows: In Section [2j we cite and prove some useful observations 
about degree sequences and include a short proof of Theorem [T] for completness. Sections [3]-[6] contain 
the proofs of Theorems [3j|6j 

2. Preliminary and known results 

Here, we denote a degree of a vertex v in a graph G by d(G,v) or simply d(v). For a set A of 
vertices, d(G,A) = d(A) = J2 a eA^( a )- ^ ne se * °f an ^-element subsets of a set X is denoted by ("^). 
For a graph G and a vertex set W C V(G) the induced subgraph of G on V(G) \ W is denoted by 
G-W. 
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Lemma 7. Let G be an n-vertex graph and A,B disjoint vertex sets of equal size such that V(G) = 
A U B. If d(A) = d{B) then e(A) = e(B). If A is a disjoint union of A%, . . . , Ak and B is a disjoint 
union of B\, . . . ,Bk, such that \Ai\ = \Bi\ and \d{Ai) — d{Bi)\ < e, i = 1, . . . , k then there are disjoint 
vertex subsets A',B' of size n/2 each such that disc(A' , B') < e. 

Proof. First assume that d(A) = d{B). Then we have that 

e{A) - e{B) = (d{A) - e(A,B))/2 - (d(B) - e(A,B))/2 = (d{A) - d(B))/2 = 0. 

Now let A and B be the disjoint union of A±, . . . .Ak and Bi, . . . , Bk, respectively, such that \Ai\ = \Bi\ 
for each i = 1, . . . , k. Assume, without loss of generality, that — d(Bi)\ > (^(^2) — d{B2)\ > 

■ ■■ > \d{Ak) — d(Bk)\. Define, for each i = 1, ...,k, 

{Ai i is odd and d(Ai) > d(Bj), 
Bi otherwise. 

Let C[ = {Ai, Bi} - d and consider A' = d U • • • U C k and B' = C[ U • • • U C' k . Since e(A') = 
(d(A') - e{A',B'))/2 and e(B') = (d(B') - e(A',B'))/2, we have 2{e{A') - e(B')) = d(A') - d(B') = 
^2i = i(d(Ci) — (i(C-)). Since this sum has terms non-increasing in absolute value with alternating signs 
and bounded in absolute value by e, the sum itself is at most e. □ 

We shall need the following number theoretic result. 

Lemma 8. Let n > 16 and k > lgn. Let X be a multiset {a±, ci2, . . .} of integers from {0, 1, . . . , n— 1} 
such that for any I < k and for any disjoint index sets . . . , i{\ and {ji, . . . , j{\ the following holds: 
a«i + cii 2 + • • • + ai t 7^ Ojj + aj 2 + • • • + aj l . Then \X\ < 2k. 

Proof. Let S be the set of all possible sums of fc-element sub- multi-sets of X. Since we assume that 
all of these are distinct, IS"! = ('■? )■ On the other hand, there are at most k(n — 1) + 1 < kn possible 
sums of k elements from X. Thus, using a well-known consequence of Stirling's inequality, 

Consequently n > 2 2fc_1 //c 3//2 . This is a contradiction since k > lgn > 4. □ 

Here, we say that two integers are almost equal if they differ by 1, 0, or —1. 

Theorem 9 (Karolyi 0). Let X be a set of m integers, each between 1 and 2m — 2. If m > 89, then 
one can partition X into two sets, X\ and X2 of almost equal sizes such that the sum of elements in 
X\ is almost equal to the sum of elements in X2 ■ 

For positive integers k,n (k < n/2), the Kneser graph K, = KG{n, k) is the graph on the vertex set 
(^') whose edge set consists of pairs of disjoint Assets. Lovasz [11] proved that the chromatic number 
of the Kneser graph KG(n, k) is n — 2k + 2. This result is a main tool for proving the lower bound on 

t(n) used by Caro and Yuster [6]. Here we repeat it for completeness. 
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Outline of the Proof of Theorem [71 For a graph G with vertex set [n] , we define a vertex coloring of 
K, by letting the number of edges induced by each /c-subset of V(G) be the color of the corresponding 
vertex in KL. The number of possible colors is most (2) + 1. For k = y/n, we have (2) + 1 < n — 2k + 2. 
Thus the number of colors is less than the chromatic number of fC and there are two adjacent vertices 
in K of the same color. Therefore there are two disjoint A;-sets of G that induce the same number of 
edges. 

To verify the upper bound, consider the disjoint union of cliques of odd orders ai, . . . ,a m , where 
aj > 2(o| + a\ + • • • + a j-i)- O ne can show that any pair of twins in this graph must omit at least 
clglgn vertices for some constant c. □ 

3. Proof of Theorem [3] 

Set k = \gn and G = (V,E). We shall find two large vertex sets A and B of G of equal size and 
small discrepancy following the following procedure. 

Step 1. Choose disjoint sets A\,B\ C V such that \Ai\ = \Bi\ < k and d(A\) = d(B\). 

By Lemma [8] this is possible. 
Step i, i > 1. Choose disjoint sets A i} Bi C V\{A\UB\\1- •■Ui i _iUB,;-i) such that \A t \ = \B{\ < k 
and d(Ai) = d(Bi). 

Again, by Lemma [8] this is possible as long as \V \ (A\ U B\ U • • • U U -Bi-i)| > 2k. 

Assume that we have to stop after step q and let S = V \ (A\ U • ■ • U A q U B\ U . . . U B q ) be the 
set of leftover vertices. So, IS"! < 2k. Consider the graph G' = G — S. Since d(G, Ai) = d(G, Bi) and 
V(G)\V(G') = S, we have that \d(G' , Ai) - d(G' , B,)\ = \d(G,A i ) + e(A i ,S)-d(G,B i )-e(B i ,S)\. 
Thus, 

\d(G',Ai)-d(G',Bi)\ < \d(G,Ai)-d(G,Bi)\+\e(Ai,S)-e(Bi,S)\ = \e(A h S)-e(B u S)\ < \Ai\\S\ < 2k 2 . 

Denoting = \d(G',Ai) — d(G',Bi)\, we have that < |^4i||5| < 2k 2 . Lemma [7] applied to G' and 
Ai, . . . , A q , B\, . . . , B q asserts the existence of disjoint vertex sets A, B in G, \A\ = \B\ = Yli=i 1^1) 
such that disc(^, B) <2k 2 = 2 lg 2 n. 

Next, we shall find two vertex sets A and B of G such that \A\ = \B\ = [n/2\ and disc(A, B) < 
^(A(G) — 5(G) + 1). Let G' = G if n is even, and G' = G \ v if n is odd, where v is any vertex of G. 
Thus G' has 2[n/2j vertices and A(G") - 5{G') < A(G) - 5(G) + 1. Let A, B be two disjoint vertex 
sets in G' with \A\ = \B\ = [n/2\ and minimum discrepancy. It remains to show that disc(A,B) < 
|(A(G') - 6(G>)). 

If A(G') = 5(G') , i.e. G' is regular, then disc(^4, B) = by Lemma[7j Hence we may assume A(G") — 
5(G') > 0. Assume, without loss of generality, that disc(^4,-B) = e(A) — e(B). Since YlaeA d(G' , a) = 
2e(A) + e(A, B) > 2e(B) + e(A, B) = Y,beB d (G' , b) there is a vertex a £ A and a vertex b G B with 
d(G', a) > d(G', b). For A' = (A \ a) U b and B' = (B \ b) U a we have 



e(A') - e(B') = e(A) - d(G' , a) - e(B) + d(G' , b) = disc(A B) - d(G' , a) + d(G' , b) < e(A) - e(B). 
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Due to the minimality of disc(A,B) it follows disc (A', B') = e(B') — e(A') and thus 

2disc(A, B) < disc(A', B') + disc(A, B) = d(G' , a) - d(G', b) < A(G') - 5(G'), 
which implies disc(A, B) < |(A(G') - 5{G')) as desired. □ 

4. Proof of Theorem [4] 

This argument is very similar to the one used by Caro and Yuster. The only difference is that 
we are using stronger discrepancy results. We shall remove a few vertices of large degree. In the 
remaining graph we find two sets T and S, such that there are no edges between T and S, S is 
large enough, and T induces a matching and independent vertices. Finding "almost" twins guaran- 
teed by Theorem[3]in G[S], and using the edges and vertices of T to balance those, gives us twins in G. 

Let G be an n-vertex graph on e edges. We shall find large twins A and B of size at least 
n ^ — 20v ^ lg " ^ each. To shorten the expressions we use / = f(n, e) = v ^ ? | sn , so we are looking 
for twins of size at least f (1 — 20/) each. In particular we may assume that n(l — 20/) > 0. Assume 
that the size of the largest independent set in G is less than n(l — 20/), otherwise we find the desired 
twins as subsets of this independent set. Assume also that e > 4. 

Consider the set L = L(x) of vertices of degree at least x = ^j. The number of edges incident 
to these vertices is at least \L\x/2. Since this number is at most the total number e of edges, \L\ < 
^ = nf. Let G 1 = G — L, a graph with maximum degree less than x and number of vertices at least 
nil — /). Next we shall consider G' only. 

Let I = 2^p- = 21g 2 n. For a vertex v, denote by N[v] the set of vertices consisting of v together 
with its neighbors in G' . For an edge eo = uv, denote by A^[eo] the set of vertices {u, v} together 
with each of their neighbors in G' . We shall choose a set T of 41 vertices from V = V(G') inducing a 
matching e%, . . . , e\ and 21 independent vertices v\,...,v% via a simple greedy procedure: 

Step 1. Pick an edge e\ with endpoints in V. 

Since \V\ > n(l — /), V can not be an independent set, so the edge e\ exists. 
Step i, 1 < i < I. Pick an edge ej with endpoints in V \ (N[ei\ U • • • U iV[ej_i]). 

Since the size ofV \ (N[ei] U • • • U N[e;_i]) is greater than n(l - /) - 2(i - l)x > n(l - /) - 2lx = 
nil — f) — 2 ■ 2 = n(l — 9/) > 0, this set is not an independent set and an edge e« can be 

selected for i = 1, . . . , I. 

Step / + 1. Pick a vertex v\ from V \ [N[ex] U • • • U N[ei}). 

Step l + j, 1 < j < 21. Pick a vertex Vj from V \ (N[ei\ U • • • U N[e t ] U N[vi] U • • • U N[vj-i]). 

The fact that this set is non-empty for each j follows from the argumentation below. 

Let S be the last set S = V\ (JV[ei] U • • • UN[e t ] UN[vt] U • • • UN[v 2l \). Then \S\ > n(l-f)-Alx = 

n(l — 17/) > 0. Moreover no vertex of S is adjacent to any vertex of T. Applying Theorem [3] to 

G[S] gives two disjoint subsets of S, A and B, of the same size at least (\S\ — 2lg\S\)/2 and with 

discrepancy 7, 7 < 21g 2 (|5|) < 21g 2 n < I. Assume, without loss of generality, that e(A) > e(B). We 
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construct a set A' by adding 27 < 21 independent vertices from T to A, and a set B' by adding 7 < Z 
edges from T to i?. Now, the discrepancy of A' and B' in G is zero since there are no edges between 
A U B and T. Thus A' and 5' are twins in G of size at least (|5| - 21g |5|)/2 > f (1 - 17/) - lgn = 
2(1- 17/)- ^> i(l-19/). " □ 

5. Proof of Theorem [5] 

Let G be a graph on n vertices, where n is even. We shall show that if one of conditions [TJP)] holds 
then t(G) = n/2, i.e., the vertex set of G can be perfectly split into twins. 

[T)| Assume that the degrees of G form a set of consecutive integers. That is, for every integer 
i, 8(G) < i < A(G), there exists a vertex of degree i. Let A and -B be sets that have equal 
cardinality that partition the vertex set and have the smallest discrepancy. If the discrepancy 
is nonzero, assume that e(A) < e(B). In particular, we see that if A has a vertex of degree d 
then B does not have a vertex of degree d + 1 (otherwise swapping the two will give a partition 
with smaller discrepancy). Similarly, if B has a vertex of degree d, then A has no vertex of 
degree d — 1. So, if d is the smallest degree of a vertex in A then ^4 contains all vertices of 
degrees d + 1, d + 2, . . . , A(G). Similarly, if is the highest degree of a vertex of B, then B 
contains all vertices of degree dl — 1, dl — 2, . . . , 6(G). Moreover <i' < d. So, we have that the 
sum of the degrees in A is strictly greater than the sum of the degrees in B, a contradiction 
to the assumption that e(A) < e(B). 

[2)| Assume that the number of vertices of degree i is even for each integer i, 8(G) < i < A(G). 
Observe using Lemma [7] that it is sufficient to partition the vertex set into two parts of equal 
sizes such that the degrees of vertices in each part have the same sum. Since each degree 
appears an even number of times, we put half of the corresponding vertices in one part and 
half in another. 



3) Assume that n > 90 and \{i : \V{\ is odd}| > n/2. For each degree d, put [d/2\ vertices of 
degree d in one part and [d/2\ vertices of degree d in another part. Let S be the set of remaining 
vertices of distinct degrees. We shall split S also using Theorem [9} Formally, for each degree 
d, let Vd = AdU BdU Sd, where \Ad\ = \B d \ and \Sd\ < 1. Then S = SsU- ■ -USa is an even set 
of more than n/2 vertices of distinct degrees. Applying Theorem [9] to the degrees of vertices 
in S, we can split S in two parts, U and V of equal sizes such that \d(U) — d(U')\ < 1. Let 
A = U U Ut 5 A d and B = U'U Uj =(5 B d . Since d(V) is even and d (\jts A d) = * (l)Ls B d) , 
then d(U) and d(U') must have the same parity, and therefore must be equal. So it must be 
the case that disc(A, B) = 0. 



4) Assume that there are least A — 5 disjoint pairs of vertices whose degrees differ by exactly 
1. With x = A — 5, there is a set of vertices {vi,v[, . . . ,v x ,v' x } such that d(v{) = d(v'A — 1. 
Partition V(G) — {vi, v[, . . . , v x , v' x } into two sets A and B, of equal cardinality with < 
d(A) — d(B) = x' , x' < x. Note that this could be done greedily by ordering the vertices in the 
non-increasing order of their degrees and alternately adding one vertex to A and the next to 
B. Let A' = Au{vi, . . .,v x >} and B' = BU{v[,.. .,v' x ,}. As a result, d(A') = d(B'). If x' ± x, 
let A" = A' U {vi : x > i > x' , i is even} U {v[ : x > i > x' , i is odd} and B" = B' U {v • : x > i > 
x',i is even} U {vi : x > i > x',i is odd}. Consequently, \d(A") — d(B")\ < 1. However, since 



the sum of the degrees of a graph is even and V(G) = A" U B" , we have that d(A") = d(B"). 
Therefore A" and B" are twins of size n/2 each. We note that this technique is reminiscent of 
the one used by Ben-Eliezer and Krivelevich in [9]. 

□ 

6. Proof of Theorem [6] 

First, we observe that we may assume that no component of our forest consists of any isolated 
edges. If it does, remove those edges and find the twins in the remaining graph and delete 0, 1 or 2 
vertices to get twins A, B. We can add the endvertices of each isolated edge the twins - one to A and 
one to B - to obtain twins for the whole forest, apart from the at most 2 vertices that were already 
deleted. 

In order to proceed, we need to define a type of twins with additional structure. Let G = (V, E) 
be a forest and A, B be twins. We say that A U B is the set of colored vertices and G[A U B] is the 
colored graph; the remaining vertices are uncolored. We define A and B to be good twins if 

• each uncolored vertex has degree at most 1 in G, 

• the set of colored vertices induces no isolated vertices, 

• all colored vertices except for a set S C A of at most two vertices have all neighbors colored, 

• S contains at most one leaf and at most one non-leaf of the colored graph, 

• if S contains a non-leaf, w, of the colored graph, then w has at most one neighbor in A, 

• if S contains a non-leaf w and a leaf v of the colored graph, then v has smaller number of 
uncolored neighbors than w. 

Because each uncolored vertex has degree at most 1 and there are no isolated edges, if A,B are good 
twins then at least one vertex of each edge is colored. Therefore, the set of edges that have exactly 
one colored vertex come in at most two disjoint stars, each centered at a vertex in S C A. We refer 
to the middle and the right of Figure [T] for examples of good twins. The figure exemplifies one step in 



the inductive proof of existence of good twins (Theorem 10). 






G 



good twins in G' 



good twins in G 



Figure 1. Illustration of the proof of Theorem 10. Left: A forest G and a vertex u 



adjacent to at least one leaf and at most one non-leaf. Middle: Good twins in the 
graph G obtained from G by deleting all leaves adjacent to u. Right: Good twins in 
G obtained by recoloring u and v and applying a (u, v)-move followed by a (w, v)-move 
- c.f. Case 3.1.b. We use red circles to indicate members of A and blue squares to 
indicate members of B. 



Theorem 10. In every forest G there exist good twins. 
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Proof. Let G = (V,E) be a forest. We do induction on the number of edges of G. If \E\ = 0, then G 
consists of isolated vertices only and good twins of G are given by A = B = 0. 

For the induction step let u be any vertex in G that is adjacent to at least one leaf of G and to at 
most one non-leaf of G. We apply induction to the subgraph G' of G obtained by deleting all leaves 
adjacent to u and obtain good twins A, B of G' . Note that u is either a leaf or an isolated vertex of 
G' . 

What follows is a case analysis. In each case we will argue how to modify the good twins A, B of 
G' so as to obtain good twins of G. The modification of twins in each case is done in one or two steps 
called moves that are defined for a pair of vertices. 

Let X and Y be two sets of vertices, x £ X , y £ Y , for any vertex v, L(v) = N(v) \(XUY). If 
L(x) and L(y) are disjoint, then an (x, y)-move with respect to X, Y (or an (x, y)-move, where 
the sets X and Y are understood) is an operation creating a pair of sets X',Y' as follows: 

• Let L'(x) C L(x) and L'(y) C L(y) be sets with \L'{x)\ = \L'(y)\ = mm{\L(x)\, \L(y)\}. 

• Let X' = X U L'{y) and Y' = Y U L'(x). 

Note that if X and Y are twins and L(x) and L(y) contain only leaves of G, the (x,y)-move 
creates twins again. Moreover, all neighbors of at least one of x, y are contained in the new twins 
X' U Y'. See Figure [2] for an illustration. We shall see an (x,y)-move as a map / that takes two sets 
of vertices X, Y and two vertices x £ X, y 6 Y and maps these to X', Y', defined as above, i.e., 
f(x,y,X,Y) = (X',Y'). 




L(x) L(y) L'{x) L'(y) 

Figure 2. An (x,y)-move with respect to the sets X of red vertices and the set Y of 
blue vertices. 

Let S C A, \S\ < 2, be the set of those vertices in G' that have uncolored neighbors. Then in G the 
set of non-leaf vertices with uncolored neighbors is given by S U u. We distinguish the cases \S\ = 
(Case 1), \S\ = 1 (Case 2) and \S\ = 2 (Case 3), and the subcases u £ A, u G B and u ^ A U B. We 
apply (x, y)-moves to twins only, with {x, y} C SUu, each time decreasing the number of vertices that 
have uncolored neighbors by at least one. In most of the cases we end up with at most one vertex x 
with uncolored neighbors, and to see that these twins are good twins it then suffices to check that x 
is incident to at most one edge in G[A] or G[B]. To be precise, we may have to swap the roles of A 
and B in the case where x £ B. However, due to simplicity of presentation we omit to mention this 
explicitly. 

In many cases we do not apply (x, y)-moves immediately. Instead, we first color some uncolored 
vertices and/or recolor some leaves of the colored graph. In the first case, we always add to A some 
vertex that has no neighbor in A and to B some vertex that has no neighbor in B, i.e., we get 
twins again. If we recolor a leaf x, say x £ A, in the colored graph this has the following effects: 
I .A \ x\ = \B U x\ — 2 and e(A \ x) = e{B U x) — 1. We repair this imbalance by either recoloring a leaf 
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y G B in the colored graph or adding two uncolored vertices to A, one with no neighbor in A and the 
other with exactly one neighbor in A. See Figure [3 (d)| and 3(e) for an example. 

• Case 1 - \S\ = 0: If u G' A U .B, then rt is an isolated vertex of G 1 and good twins of G are 
given by ^4 U {«}, B U {x} for some vertex x G L(u). li u £ AL) B then A, are already good 
twins for G. In both cases, the new set S is {u}. 




(a) Case 2.1 - it G B: 





(b) Case 2. 2. a - it G L(w): 

(^',-B') = /(iu,ti,AUa:,BUit) 




y x y 

(d) Case 2. 3. a - u € A, \L(w)\ < \L(u)\ 
and its colored neighbor is in A: 
(A', B') = /(to, it, (i4\«)U{a; > 2/}, Bum) 




(c) Case 2.2.b - it has no colored neighbor: 
{A'., B') = f(w, u,AUx,BUu) 




y x y 

(e) Case 2.3. b - it G A, \L(w)\ < \L(u)\ 
and its colored neighbor is in B: 
(A', B') = f(w, u, (A\u)U{x, y}, BUu) 



Figure 3. Illustration of Case 2 in the proof of Theorem 10 



Case 2 - \S\ = 1: Let S = {w} C A. We may assume, without loss of generality, that w is a 
non-leaf of GL4 U B]. Next, we distinguish several cases depending on whether u £ A, u G B, 
or u A U B. Figure [3] shows for each case how to obtain good twins for G by (re)coloring 
some vertices and applying a (w, n)-move. The procedure is formally described as follows: 

— Case 2.1- u G B: We perform a single (w, u)-move. 

— Case 2.2- u AuB: Either u G L(w) or u was isolated in G' . In either case, we first add 
u to B and any x G £(it) to A We obtain twins again, to which we perform a (w, ii)-move. 
If | ) | > |L(u)|, then the set S remains {w}. If |£(iw)| < then the roles of A 
and B are switched and S becomes {u}. If |£(u>)| = |L(w)|, then 5 = 0. 

— Case 2.3 - u G A: If to is a non-leaf of G'[A U B] and \L(w)\ > \L(u)\, then A, B 
are already good twins of G and the new set S is {w,u}. In fact, we may assume that 
1-^(^)1 < l-^( n )|) since if w is a leaf of CL4U-B] and |£(u>)| > |L(n)|, we can swap the roles 



of w and u. First, we recolor u (remove it from A and add it to B), add some x G L(w) as 



well as some y G L(u) to A, and perform a (w, u)-move to the resulting twins. Figure 3(d) 



and 3(e) depicts the case that the colored neighbor of u is in A and B, respectively. 
In Case 3 below when we recolor a vertex x we do not distinguish the subcases whether the 
colored neighbor of x is in A or B, since the treatment is the same. We indicate in Figure [4] 
the first subcase only. 




(c) Case 3.2.a - u 6 L(w), \L(w)\ < \L(u)\ + \L(v)\: (d) Case 3.2.b - u G L{w), \L(w)\ > \L(u)\ + \L(v)\: 

(A', B') = f(w, v, f(w, u, {A\v)u{x, y, z}, Bu{u, v})) (A', B') = /(to, u, A U x, B U u) 

(The same is done if u has no colored neighbor.) (The same is done if u has no colored neighbor.) 




(A', B') = f(w, u, f(v, u, (A\u)u {x, y}, B u u)) 



Figure 4. Illustration of Case 3 in the proof of Theorem 10 



Case 3 - \S\ = 2: By the definition of good twins we have S = {v,w} C A, v a leaf and w 
a non-leaf of G'[AU B], and \L(v)\ < \L(w)\. We distinguish, respectively, whether u £ B, 
u £ L(w), u G -L(w) or u G A. We further distinguish subcases depending on the sizes of L(u), 
L(v) and L(w). The treatment for each case is shown in Figure [4j the description follows. 
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— Case 3.1. a- u £ B and \L{u)\ > \L(v)\: We perform a (v,u)-move and afterwards a 
(w, u)-move. Note that if |£(it)| = |L(t>)|, i.e., u has no uncolored neighbors after the 
(v, u)-move, the second move has no effect. 

— Case S.l.b- u £ B and \L(u)\ < \L(v)\: In particular, this means |L(it)| < Then 
we swap the colors of u and v, i.e., remove u from B and add it to A as well as removing 
v from A and adding it to B. Since both u and v are leaves in the colored graph, we 
obtain twins again. Then apply a (u, v)-move followed by a (w, v)-move. 

— Case 3. 2. a- either u £ L(w) or u was isolated in G' and \L{w)\ < \L(u)\ + \L(v)\: We add 
u to B and some x £ L(u) to A, obtaining twins again. If |L(u>)| < |L(u)|, then perform 
a (if, ti)- move followed by a (v,u)-move. If |£(tL>)| > |L(u)|, then, recolor v and add some 
y £ L(u) (if = 1 take some y £ L{v) instead) and z £ L{w) to A. To the resulting 
twins we apply a (w,u)-move followed by a (w,v)-move. 

— Case 3.2.b- either u £ L(w) or u was uncolored and isolated in G' and \L{w)\ > \L(u)\ + 

We again add u to B and some x £ L(u) to A. To the resulting twins apply a 
(u;,-u)-move only. Note that in this case, both v and w still have uncolored neighbors. 

— Case 3.3. a- u £ L(v) and \L(v)\ < \L(u)\: We add u to B and some x £ L(u) to A, and 
then perform a (v , u)-move followed by a (u>, u)-move. 

— Case 3.3.b- u £ L{v) and \L(v)\ > \L{u)\: We recolor v and compensate the imbalanced 
edge-count and vertex-count by adding u as well as some x £ L(u) to A. We perform a 
(u, u)-move followed by a (w, v)-move. 

— Case 3.4- u £ A: Assume, without loss of generality, that > |£(f)| (otherwise swap 
the roles of u and v). We recolor u and add some x £ L{u) as well as some y £ L(v) to A. 
This produces twins to which we perform a (v,u)-move and afterwards a (w, n)-move. 

It is straightforward to check that our treatment in each of the above cases results in good twins 
A',B' of G. The formal definition of (A',B f ) in Case 2 and Case 3 is given in Figure [3] and Figure [4j 
respectively. □ 

Having Theorem |10[ we can prove Theorem [6] quite easily. 



Proof of Theorem^ Let G = (V,E) be a forest. By Theorem 10, G admits good twins A, B. The 
vertices in V\ (AL)B), i.e., the uncolored vertices, come in three groups, each of which may be empty. 
The first group S± contains all isolated vertices of G. The second group £2 consists of some leaves of 
G that are all adjacent to a leaf v £ A of G[A U B\. The third group S3 consists of some leaves of G 
that are all adjacent to a non-leaf w £ A ol G[A U B]. Moreover, if S3 7^ 0, then IS3I > |S2|. 

• Case 1 - S2 = S3 = 0; Partition Si into equal-sized subsets by removing at most one vertex, 
and add one subset to each of A, B. 

• Case 2 - exactly one 0/S2, S3 is non-empty: We will delete the corresponding vertex v or w 
from A, say it's w. I.e., we will uncolor w but color some of its neighbors. 

If w has no neighbor in A, then uncolor w and add all but at most one of the remaining 

uncolored vertices to A and to B evenly so that the resulting sets are twins of size [n/2] — 1. 
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So, we may assume that w has a neighbor in A (exactly one, by the condition of twins being 
good). 

It is easy to see that G has a leaf in B. For example, contracting all edges in G[A] and in 
G[B] and removing the uncolored vertices in S gives a forest G whose bipartition classes are 
induced by A and B and are of equal size. Every leaf of G in B corresponds to at least one 
leaf of G in B and every component of G with at least as many vertices in B as in A has a leaf 
in B. So let u € B be a leaf of G. Then uncolor w and recolor u to be in A. Add all but at 
most one of the remaining uncolored vertices to A and to B evenly so that the resulting sets 
are twins of size \n/2\ — 1. 
• Case 3 - S2 7^ and S3 7^ 0: If w has a neighbor in A, then add some vertex x £ S2 to A If 
w has no neighbor in A, then and some vertex y £ S3 to A. Uncolor w. These are twins again. 

Next, add all vertices in S2, except possibly x, to B, and add the same number of vertices 
from S3 to A. (Note that IS3I > IS2I and hence IS3 \ y\ > IS2IO in the resulting twins, no 
uncolored vertex has a colored neighbor and thus we can find twins of G by removing at most 
one further vertex. In any case, the graph G contains at most 2 uncolored vertices and the 
colors form twins. 

This completes the proof. □ 
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